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Fundamental to the design of reliable, high-performance network services is an 
understanding of the performance characteristics of the service as perceived by the client 
population as a whole. Understanding and measuring such end-to-end service performance 
is a challenging task. Current techniques include periodic sampling of service characteristics 
from strategic locations in the network and instrumenting Web pages with code that reports 
client-perceived latency back to a performance server. Li ... 
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Web link analysis has proven to be a significant enhancement for quality based web search. 
Most existing links can be classified into two categories: intra-type links (e.g., web 
hyperlinks), which represent the relationship of data objects within a homogeneous data 
type (web pages), and inter-type links (e.g., user browsing log) which represent the 
relationship of data objects across different data types (users and web pages). 
Unfortunately, most link analysis research only considers one type of ... 

Keywords: data fusion, information retrieval, link analysis algorithms, link fusion 
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We present a fast compression technique for natural language texts. The novelties are that 
(1) decompression of arbitrary portions of the text can be done very efficiently, (2) exact 
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search for words and phrases can be done on the compressed text directly, using any known 
sequential pattern-matching algorithm, and (3) word-based approximate and extended 
search can also be done efficiently without any decoding. The compression scheme uses a 
semistatic word-based model and a Huffman code wher ... 

Keywords: compressed pattern matching, natural language text compression, word 
searching, word-based Huffman coding 



4 DB-IR-2 (databases and information retieval): web and XML text search: Providing 

consistent and exhaustive relevance assessments for XML retrieval evaluation 
Benjamin Piwowarski, Mounia Lalmas 

November 2004 Proceedings of the Thirteenth ACM conference on Information and 
knowledge management 

Full text available: ^pdf(300.16 KB) Additional Information: full citation , abstract , references , index terms 

Comparing retrieval approaches requires test collections, which consist of documents, 
queries and relevance assessments. Obtaining consistent and exhaustive relevance 
assessments is crucial for the appropriate comparison of retrieval approaches. Whereas the 
evaluation methodology for flat text retrieval approaches is well established, the evaluation 
of XML retrieval approaches is a research issue. This is because XML documents are 
composed of nested components that cannot be considered indepe ... 

Keywords: INEX, XML, evaluation, relevance assessment process 
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Keywords: Biclustering, simultaneous clustering, coclustering, subspace clustering, 
bidimensional clustering, direct clustering, block clustering, two-way clustering, two-mode 
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Experiments with a component theory of probabilistic information retrieval based on 
single terms as document components 

K. L. Kwok 

October 1990 ACM Transactions on Information Systems (TOIS), volume 8 issue 4 

Full text available- f g| pdfd.84 MB) Additional Information: full citation , abstract , references , citings , index 
^ terms , review 

A component theory of information retrieval using single content terms as component for 
queries and documents was reviewed and experimented with. The theory has the 
advantages of being able to (1) bootstrap itself, that is, define initial term weights naturally 
based on the fact that items are self relevent; (2) make use of within-item term 
frequencies; (3) account for query-focused and document-focused indexing and retrieval 
strategies cooperatively; and (4) allow for component-specific fe ... 

Keywords: document-focused and query-focused relevance feedback, indexing and 
retrieval, inverse collection term frequency weighting, inverse document frequency 
weighting, probabilistic indexing, probabilistic retrieval, ranking and weighting of composite 
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7 A statistically emergent approach for language processing: application to modeling 
context effects in ambiguous Chinese word boundary perception 
Kok-Wee Gan, Kim-Teng Lua, Martha Palmer 
December 1996 Computational Linguistics, volume 22 issue 4 

Full text available: . 



^pdf(1.51 MB) Additional Information: full citation , abstract , references , citings 
Publisher Site 

This paper proposes that the process of language understanding can be modeled as a 
collective phenomenon that emerges from a myriad of microscopic and diverse activities. 
The process is analogous to the crystallization process in chemistry. The essential features 
of this model are: asynchronous parallelism; temperature-controlled randomness; and 
statistically emergent active symbols. A computer program that tests this model on the task 
of capturing the effect of context on the perception of ambi ... 

Summarization and question answering: Using librarian techniques in automatic text 
summarization for information retrieval 
Min-Yen Kan, Judith L. Klavans 

July 2002 Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries 

p ii* ^ ui a ma ac kab\ Additional Information: full citation , abstract , references , citings , index 

Full text available: TSa pdf(1.15 MB) 

terms 

A current application of automatic text summarization is to provide an overview of relevant 
documents coming from an information retrieval (IR) system. This paper examines how 
Centrifuser, one such summarization system, was designed with respect to methods used in 
the library community. We have reviewed these librarian expert techniques to assist 
information seekers and codified them into eight distinct strategies. We detail how we have 
operationalized six of these strategies in Centrifuser by c ... 

Keywords: automatic text summarization, information retrieval user interfaces, reference 
librarian techniques 



9 Structural computing: Unifying structure, behavior, and data with themis types and 
templates 

William Van Lepthien, Kenneth M. Anderson 

August 2004 Proceedings of the fifteenth ACM conference on Hypertext & hypermedia 

Full text available: ^ pdf(332.32 KB) Additional Information: full citation , abstract , references , index terms 

Structural computing evolved from work on open hypermedia to aid in the creation of 
software infrastructure. Open hypermedia had produced software that provided applications 
with access to hypermedia structures and services. The question was asked if these results 
could be generalized to create similar tools for other domains. Initial work focused on the 
development of structure servers that can create and manipulate domain-specific structures, 
but little work focused on allowing those structure ... 

Keywords: Chimera, Themis, structural computing, templates, types 
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June 2002 Proceedings of the 39th conference on Design automation 



c g e cf c 



Results (page 1): destinations 1 same object$l same retriev$4 same categor$4 same search$4 same word$4 Page 4 o 
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Routing is an important problem in the process of design creation. In this paper, we focus 
on the problem of designing a database for the non-partitioned routing problem. New 
technology libraries describe constraints that are hard to manage in grid-based approaches 
to the routing database. While general region query based data-structures have been 
proposed, they typically suffer from speed problems when applied to large blocks. We 
introduce an interval-based approach. It provides more flexibilit ... 

Keywords: database, physical design, routing 



12 Skeletal/medial axis representations: Any open bounded subset of has the same 
homotopv type than its medial axis 
Andre Lieutier 

June 2003 Proceedings of the eighth ACM symposium on Solid modeling and 
applications 

Full text available: ^pdf(419.63 KB) Additional Information: full citation , abstract, references , index terms 

Medial Axis Transform is sometimes used as an intermediate representation in algorithms 
for meshing or recognition of shapes from digitized data. This raises the question whether 
the Medial Axis captures fundamental topological invariants of the object. The (positive) 
answer has been known already in the case of smooth objects. The main result presented 
here is the homotopy equivalence of any bounded open subset of R n with its Medial Axis. 

Keywords: homotopy, medial axis, skeleton, surface reconstruction 



13 Common Lisp Object System specification 

Daniel G. Bobrow, Linda G. DeMichiel, Richard P. Gabriel, Sonya E. Keene, Gregor Kiczales, 
David A. Moon 

September 1988 ACM SIGPLAN Notices, Volume 23 Issue SI 

Full text available: ^ pdf(6.88 MB) Additional Information: full citation , citings 



14 Fast detection of communication patterns in distributed executions 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available: ^pdf(4.21 MB) Additional Information: full citation , abstract , references, index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 

15 On completeness of historical relational query languages 
James Clifford, Albert Croker, Alexander Tuzhilin 

March 1994 ACM Transactions on Database Systems (TODS), Volume 19 issue l 

Full text available- 1S pdf(3.55 MB) Additional Information: full citation , abstract , references , citings , index 

terms, review 

Numerous proposals for extending the relational data model to incorporate the temporal 
dimension of data have appeared in the past several years. These proposals have differed 
considerably in the way that the temporal dimension has been incorporated both into the 
structure of the extended relations of these temporal models and into the extended 
relational algebra or calculus that they define. Because of these differences, it has been 
diffic ... 
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16 An object-based programming model for shared data 
Gail E. Kaiser, Brent Hailpern 

April 1992 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 14 Issue 2 

Full text available* f£|pdf(3 .28 MB) Additional Information: full citation , abstract , references , index terms . 
^ review 

The classical object model supports private data within objects and clean interfaces between 
objects, and by definition does not permit sharing of data among arbitrary objects. This is a 
problem for real-world applications, such as advanced financial services and integrated 
network management, where the same data logically belong to multiple objects and may be 
distributed over multiple nodes on the network. Rather than give up the advantages of 
encapsulated objects in modeling real-world en ... 

Keywords: coordination language, daemons, financial applications, object-based, real-time, 
sharing 



17 Concurrency control in advanced database applications 
NaserS. Barghouti, Gail E. Kaiser 

September 1991 ACM Computing Surveys (CSUR), volume 23 issue 3 

Full text available: ^ pdf(4,69 MB) Additional Information: full citation , references , citings , index terms 



Keywords: advanced database applications, concurrency control, cooperative transactions, 
design environments, extended transaction models, long transactions, object-oriented 
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18 Optimal structure identification with greedy search 
David Maxwell Chickering 

March 2003 The Journal of Machine Learning Research, Volume 3 

Full text available - fil pdf(462 82 KB) Additional Information: full citation , abstract , references , citings , index 
^ : terms 

In this paper we prove the so-called "Meek Conjecture". In particular, we show that if a DAG 
H is an independence map of another DAG G, then there exists a finite sequence of edge 
additions and covered edge reversals in G such that (1) after each edge modification H 
remains an independence map of G and (2) after all modifications G =H. As shown by Meek 
(1997), this result has an important consequence for Bayesian approaches to learning 
Bayesian ne ... 

19 Programming languages for distributed computing systems 
Henri E. Bal, Jennifer G. Steiner, Andrew S. Tanenbaum 
September 1989 ACM Computing Surveys (CSUR), Volume 21 issue 3 

Full text available: ffi Pdf(6.50 MB) Additional Information: full citation , abstract, references , citings, index 
^ terms , review 

When distributed systems first appeared, they were programmed in traditional sequential 
languages, usually with the addition of a few library procedures for sending and receiving 
messages. As distributed applications became more commonplace and more sophisticated, 
this ad hoc approach became less satisfactory. Researchers all over the world began 
designing new programming languages specifically for implementing distributed applications. 
These languages and their history, their underlying pr ... 
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20 Software error analysis: a real case study involving real faults and mutations Q 
Murial Daran, Pascale Thevenod-Fosse 

May 1996 ACM SIGSOFT Software Engineering Notes , Proceedings of the 1996 ACM 
SIGSOFT international symposium on Software testing and analysis, volume 
21 Issue 3 

Full text available: ^ pdffl.20 MB) Additional Information: full citation , abstract, references , citings, index 
^ terms 

The paper reports on a first experimental comparison of software errors generated by real 
faults and by lst-order mutations. The experiments were conducted on a program 
developed by a student from the industrial specification of a critical software from the civil 
nuclear field. Emphasis was put on the analysis of errors produced upon activation of 12 real 
faults by focusing on the mechanisms of error creation, masking, and propagation up to 
failure occurrence, and on the comparison of these erro ... 
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